QSAR Classification Model for Antibacterial Compounds and Its Use in Virtual Screening

نویسندگان

  • Narender Singh
  • Sidhartha Chaudhury
  • Ruifeng Liu
  • Mohamed Diwan M. AbdulHameed
  • Gregory J. Tawa
  • Anders Wallqvist
چکیده

As novel and drug-resistant bacterial strains continue to present an emerging health threat, the development of new antibacterial agents is critical. This includes making improvements to existing antibacterial scaffolds as well as identifying novel ones. The aim of this study is to apply a Bayesian classification QSAR approach to rapidly screen chemical libraries for compounds predicted to have antibacterial activity. Toward this end we assembled a data set of 317 known antibacterial compounds as well as a second data set of diverse, well-validated, non-antibacterial compounds from 215 PubChem Bioassays against various bacterial species. We constructed a Bayesian classification model using structural fingerprints and physicochemical property descriptors and achieved an accuracy of 84% and precision of 86% on an independent test set in identifying antibacterial compounds. To demonstrate the practical applicability of the model in virtual screening, we screened an independent data set of ~200k compounds. The results show that the model can screen top hits of PubChem Bioassay actives with accuracy up to ~76%, representing a 1.5-2-fold enrichment. The top screened hits represented a mixture of both known antibacterial scaffolds as well as novel scaffolds. Our study suggests that a well-validated Bayesian classification QSAR approach could compliment other screening approaches in identifying novel and promising hits. The data sets used in constructing and validating this model have been made publicly available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unify QSAR approach to antibacterial activity of organic drugs against different species

There are many different kinds of pathogen bacteria species with very different susceptibility profile to different antibacterial drugs. One limitation of QSAR models are the biological activity of drugs against only one bacteria species. In previous paper we develop one unified Markov model to describe the biological activity of different drugs tested in the literature against some of the anti...

متن کامل

Molecular Modeling on Pyrimidine-Urea Inhibitors of TNF-α Production: An Integrated Approach Using a Combination of Molecular Docking, Classification Techniques, and 3D-QSAR CoMSIA

Molecular docking, classification techniques, and 3D-QSAR CoMSIA were combined in a multistep framework with the ultimate goal of identifying potent pyrimidine-urea inhibitors of TNF-α production. Using the crystal structure of p38α, all the compounds were docked into the enzyme active site. The docking pose of each compound was subsequently used in a receptor-based alignment for the generation...

متن کامل

Estimation of the applicability domain of kernel-based machine learning models for virtual screening

BACKGROUND The virtual screening of large compound databases is an important application of structural-activity relationship models. Due to the high structural diversity of these data sets, it is impossible for machine learning based QSAR models, which rely on a specific training set, to give reliable results for all compounds. Thus, it is important to consider the subset of the chemical space ...

متن کامل

A lazy learning-based QSAR classification study for screening potential histone deacetylase 8 (HDAC8) inhibitors.

Histone deacetylases 8 (HDAC8) is an enzyme repressing the transcription of various genes including tumour suppressor gene and has already become a target of human cancer treatment. In an effort to facilitate the discovery of HDAC8 inhibitors, two quantitative structure-activity relationship (QSAR) classification models were developed using K nearest neighbours (KNN) and neighbourhood classifie...

متن کامل

In Silico Studies toward the Discovery of New Anti-HIV Nucleoside Compounds through the Use of TOPS-MODE and 2D/3D Connectivity Indices. 2. Purine Derivatives

The TOPological Substructural MOlecular DEsign (TOPS-MODE) approach has been used to predict the anti-HIV activity in MT-4 assays (Estrada et al., 2002) of a diverse range of purine-based nucleosides. A database of 206 nucleosides has been selected from the literature and a theoretical virtual screening model has been developed. The model is able of discriminating between compounds that have an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 52 10  شماره 

صفحات  -

تاریخ انتشار 2012